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(54) Illumination optimization for specific mask patterns 



(57) A method and apparatus for microlithography. 
The method and apparatus include optimizing illumina- 
tion modes based on characteristics of a specific mask 
pattern. The illumination is optimized by determining an 
appropriate illumination mode based on diffraction or- 
ders of the reticle, and the autocorrelation of the projec- 



tion optic. By elimination of parts of the illumination pat- 
tern which have no influence on modulation, excess DC 
light can be reduced, thereby improving depth of focus. 
Optimization of mask patterns includes addition of sub- 
resolution features to reduce pitches and discretize the 
probability density function of the space width. 



CM 
< 

T— 

CO 
CO 

o> 

CO 
CM 



Fig.1. 



K(a,p) 




Printed by Jouve, 75001 PARIS (FR) 



BNSDOCID: <EP 1 239331 A2J_> 



EP 1 239 331 A2 

Description 

[0001] This invention relates generally to a method and apparatus for microlithographic imaging. More particularly, 
it relates to an apparatus and method for optimizing an illumination configuration according to the specific pattern being 
s imaged. 

Optical lithography systems are in current use in the manufacture of integrated circuits and other fine featured products 
such as programmable gate arrays. In a most general description, a lithography apparatus includes an illumination 
system which provides a projection beam of radiation, a support structure which holds a patterning means, a substrate 
table which holds a substrate, and a projection system (lens) for imaging the patterned beam onto a target portion of 
10 the substrate. 

[0002] The term patterning means should be broadly interpreted as referring to devices and structures that can be 
used to endow an incoming radiation beam with a patterned cross-section, corresponding to a pattern that is to be 
created in a target portion of a substrate; the term "light valve" has also been used in this context. Generally, the pattern 
will correspond to a particular functional layer in a device being created in the target portion, such as an integrated 
15 circuit or other device. 

[0003] One example of such a device is a mask, which is generally held by a (movable) mask table. The concept of 
a mask is well known in lithography, and it includes mask types such as binary, alternating phase-shift, and attenuated 
phase-shift, as well as various hybrid mask types. Placement of such a mask in the projection beam causes selective 
transmission (in the case of a trans missive mask) or reflection (in the case of a reflective mask) of the radiation impinging 

20 on the mask, according to the pattern on the mask. The mask table ensures that the mask can be held at a desired 
position in the incoming projection beam, and that it can be moved relative to the beam if so desired. 
[0004] Another example of such a device is a matrix-addressable surface having a viscoelastic control layer and a 
reflective surface. The basic principle behind such an apparatus is that (for example) addressed areas of the reflective 
surface reflect incident light as diffracted light, whereas un addressed areas reflect incident light as undiffracted light. 

25 Using an appropriate filter, the said undiffracted light can be filtered out of the reflected beam, leaving only the diffracted 
light behind; in this manner, the beam becomes patterned according to the addressing pattern of the matrix-addressable 
surface. An alternative embodiment of a programmable mirror array employs a matrix arrangement of tiny mirrors, 
each of which can be individually tilted about an axis by applying a suitable localized electric field, or by employing 
piezoelectric actuation means. Once again, the mirrors are matrix-addressable, such that addressed mirrors will reflect 

30 an incoming radiation beam in a different direction to unaddressed mirrors; in this manner, the reflected beam is pat- 
terned according to the addressing pattern of the matrix-addressable mirrors. The required matrix addressing can be 
performed using suitable electronic means. In both of the situations described hereabove, the patterning means can 
comprise one or more programmable mirror arrays. More information on mirror arrays as here referred to can be 
gleaned, for example, from United States Patents US 5,296,891 and US 5,523,193, and PCT patent applications WO 

35 98/38597 and WO 98/33096, which are incorporated herein by reference. In the case of a programmable mirror array, 
the said support structure may be embodied as a frame or table, for example, which may be fixed or movable as required. 
[0005] Another example is a programmable LCD array, in which case the support structure can again be a frame or 
table, for example. An example of such a construction is given in United States Patent US 5,229,872, which is incor- 
porated herein by reference. 

40 [0006] For purposes of simplicity, the rest of this text may, at certain locations, specifically direct itself to examples 
involving a mask; however, the general principles discussed in such instances should be seen in the broader context 
of the patterning means as hereabove set forth. 

[0007] The term projection system encompasses various types of projection systems. Though "lens" in a layperson's 
understanding usually connotes only refractive optics, herein this term is used broadly to include catoptric and catadi- 
45 optric systems, for example. The illumination system may also include elements operating according to any of these 
principles for directing, shaping or controlling the projection beam, and such elements may also be referred to below, 
collectively or singularly, as a "lens". 

[0008] Additionally, the term "wafer table" may be used without implying that the substrate receiving the image is a 
silicon wafer, but may rather indicate a stage suited for support of any substrate to be processed by the lithography 
so apparatus. 

[0009] Lithographic projection apparatus can be used, for example, in the manufacture of integrated circuits (ICs). 
In such a case, the patterning means may generate a circuit pattern corresponding to an individual layer of the IC, and 
this pattern can be imaged onto a target portion (comprising one or more dies) on a substrate (silicon wafer) that has 
been coated with a layer of radiation-sensitive material (resist). In general, a single wafer will contain a network of 
55 adjacent target portions that are successively irradiated via the projection system, one at a time. In current apparatus, 
employing patterning by a mask on a mask table, a distinction can be made between two different types of machine. 
In one type of lithographic projection apparatus, each target portion is irradiated by exposing the entire mask pattern 
onto the target portion at once; such an apparatus is commonly referred to as a wafer stepper. In an alternative appa- 
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ratus — commonly referred to as a step-and-scan apparatus — each target portion is irradiated by progressively 
scanning the mask pattern under the projection beam in a given reference direction (the "scanning" direction) while 
synchronously scanning the substrate table parallel or anti-parallel to this direction. Since, in general, the projection 
system will have a magnification factor M (generally < 1), the speed V at which the substrate table is scanned will be 
5 a factor M times that at which the mask table is scanned. More information with regard to lithographic devices as here 
described can be gleaned, for example, from US 6,046,792, incorporated herein by reference. 

[0010] In a manufacturing process using a lithographic projection apparatus, a pattern (e.g. in a mask) is imaged 
onto a substrate that is at least partially covered by a layer of radiation-sensitive material (resist). Prior to this imaging 
step, the substrate may undergo various procedures, such as priming, resist coating and a soft bake. After exposure, 

io the substrate may be subjected to other procedures, such as a post-exposure bake (PEB), development, a hard bake 
and measurement/inspection of the imaged features. This array of procedures is used as a basis to pattern an individual 
layer of a device, e.g. an 1C. Such a patterned layer may then undergo various processes such as etching, ion-implan- 
tation (doping), metallization, oxidation, chemo-mechanical polishing, etc., all intended to finish off an individual layer. 
If several layers are required, then the whole procedure, or a variant thereof, will have to be repeated for each new 

15 layer. Eventually, an array of devices will be present on the substrate (wafer). These devices are then separated from 
one another by a technique such as dicing or sawing, whence the individual devices can be mounted on a carrier, 
connected to pins, etc. Further information regarding such processes can be obtained, for example, from the book 
"Microchip Fabrication: A Practical Guide to Semiconductor Processing", Third Edition, by Peter van Zant, McGraw 
Hill Publishing Co., 1997, ISBN 0-07-067250-4, incorporated herein by reference. 

20 [0011] For the sake of simplicity, the projection system may hereinafter be referred to as the "lens"; however, this 
term should be broadly interpreted as encompassing various types of projection system, including refractive optics, 
reflective optics, and catadioptric systems, for example. The radiation system may also include components operating 
according to any of these design types for directing, shaping or controlling the projection beam of radiation, and such 
components may also be referred to below, collectively or singularly, as a "lens". Further, the lithographic apparatus 

25 may be of a type having two or more substrate tables (and/or two or more mask tables). In such "multiple stage" devices 
the additional tables may be used in parallel, or preparatory steps may be carried out on one or more tables while one 
or more other tables are being used for exposures. Dual stage lithographic apparatus are described, for example, in 
US 5,969,441 and WO 98/40791 , incorporated herein by reference. 

[0012] As illumination systems have evolved from producing conventional to annular, and on to quadrupole and more 
30 complicated illumination configurations, the control parameters have concurrently become more numerous. In a con- 
ventional illumination pattern, a circular area including the optical axis is illuminated, the only adjustment to the pattern 
being to alter the outer radius (o>). Annular illumination requires the definition of an inner radius (a c ) in order to define 
the illuminated ring. For multipole patterns, the number of parameters which can be controlled continues to increase. 
For example in a quadrupole illumination configuration, in addition to the two radii, a pole angle a defines the angle 
35 subtended by each pole between the selected inner and outer radii. 

[0013] Concurrently, mask technology has been evolving as well. Binary intensity masks have given way to phase- 
shifted masks and other advanced designs. While a binary mask simply transmits, reflects or blocks imaging radiation 
at a given point, a phase-shifted mask may attenuate some radiation or it may transmit or reflect the light after imparting 
a.phase shift, or both. Phase-shifted masks have been used in order to image features which are on the order of the 
40 imaging radiation's wavelength or smaller, since diffraction effects at these resolutions can cause poor contrast and 
end-of-line errors, among other problems. 

[0014] The various types of illumination configurations can be used to provide improvements in resolution, depth of 
focus, contrast and other characteristics of the printed image. However, each illumination type has certain tradeoffs. 
For example, improved contrast may come at the expense of depth of focus; each type of mask has a performance 
45 which is dependent on the pattern to be imaged as well. 

[0015] Conventionally, in order to select the optimum illumination mode for a given pattern to be imaged onto a wafer, 
a series of test wafers has been exposed and compared on a hit-or-miss basis. As noted above, modern illumination 
systems have ever increasing numbers of variables which can be manipulated. As the various permutations of variable 
settings increase, the cost of trial and error optimization of illumination configurations becomes very large and quan- 
go titative methods of selecting illumination configurations are needed. 

[0016] To provide for the above identified needs and others, the present invention provides a method of optimizing 
an illumination profile for a selected patterning means pattern, comprising the steps of: 

defining a transmission cross coefficient function for an optical system including an illuminator and the selected 
55 patterning means pattern; 

determining relative relevance to imaging of diffraction orders based on the selected pattern; and 
calculating an optimized illumination configuration from the transmission cross coefficient function, weighting re- 
gions of the illumination configuration based on the relative relevance to imaging of the diffraction orders. 



3 



BNSDOCID: <EP 1 239331 A2_l_> 



10 



EP 1 239 331 A2 

[0017] According to another aspect of the present invention there is provided a device manufacturing method com- 
prising the steps of: 

(a) providing a substrate that is at least partially covered by a layer of radiation-sensitive material; 

(b) providing a projection beam of radiation using an illumination system; 

(c) using patterning means to endow the projection beam with a pattern in its cross -section; 

(d) projecting the patterned beam of radiation onto a target portion of the layer of radiation-sensitive material, 
wherein, prior to step (d), the cross-sectional intensity distribution in the projection beam produced in step (b) is 
tailored to the pattern employed in step (c) using a method as described above. 

[0018] According to another aspect of the present invention there is provided a lithographic projection apparatus 
comprising: 

an illumination system for providing a projection beam of radiation; 
'5 - a support structure for supporting patterning means, the patterning means serving to pattern the projection beam 
according to a desired pattern; 
a substrate table for holding a substrate; 

a projection system for projecting the patterned beam onto a target portion of the substrate, 

20 wherein the apparatus additionally comprises: 

calculating means, for defining a transmission cross coefficient function for the illuminator and the desired pattern, 
determining relative relevance to imaging of diffraction orders based on the pattern produced by the patterning 
means, and calculating an optimized illumination configuration from the transmission cross coefficient function, 
25 weighing regions of the illumination configuration based on the relative relevance to imaging of the diffraction 

orders; and 

selecting means, for selecting the cross-sectional intensity distribution in the projection beam exiting the illumina- 
tion system in accordance with the illumination configuration calculated by the calculating means. 

30 [001 9] According to a yet another aspect of the present invention there is provided a method of optimizing a selected 
mask design comprising: 

identifying critical features of the selected mask design; 

determining an optimized illumination profile based on diffraction orders of the critical features; and 
35 modifying the selected mask design by use of optical proximity correction techniques which are selected to reduce 

a number of pitches present in the selected mask design. 

[0020] The present invention further provides computer programs for carrying out the methods described above. 
[0021] Although specific reference may be made in this text to the use of the apparatus according to the invention 

40 in the manufacture of ICs, it should be explicitly understood that such an apparatus has many other possible applica- 
tions. For example, it may be employed in the manufacture of integrated optical systems, guidance and detection 
patterns for magnetic domain memories, liquid-crystal display panels, thin-film magnetic heads, etc. The skilled artisan 
will appreciate that, in the context of such alternative applications, any use of the terms "reticle", "wafer" or "die" in this 
text should be considered as being replaced by the more general terms "mask", "substrate" and "target portion", re- 

45 spectively. 

[0022] The present invention will be described further below with reference to exemplary embodiments and the ac- 
companying drawings, in which: 

Figure 1 is a diagram of the transmission cross coefficient function for a generalized image forming system; 
50 Figure 2 is an example of a brick wall isolation pattern microlithographic mask feature; 

Figure 3 is a representation of the diffraction orders of the mask feature of Figure 2; 

Figure 4 is a map of the calculated optimized four dimensional illumination configuration for the mask feature of 
Figure 2; 

Figure 5 is a calculated starting gray scale illumination configuration (Jj ot ) for the mask feature of Figure 2; 
55 Figure 6 is a binary representation of the illumination configuration of Figure 5; 

Figure 7 shows analysis of a print of the mask feature of Figure 2 printed with an annular illumination configuration; 
Figure 8 shows analysis of a print of the mask feature of Figure 2 printed with an optimized elliptical illumination 
configuration; 
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Figure 9 is a map of the calculated optimized four dimensional illumination configuration for the mask feature of 
Figure 2 scaled to 110nm design rules; 

Figure 10 is a calculated starting gray scale illumination configuration for the mask feature of Figure 2 scaled to 
110nm design rules; 

5 Figures 11a and 11b are binary representations of the illumination configuration of Figure 10 with differing values 

of o; 

Figure 12 is an example of a mask pattern with critical gates and cells indicated; 

Figure 1 3 is the mask pattern of Figure 1 2 with assist features added to reduce the number of pitches in the pattern; 
Figure 1 4 compares probability density functions of space widths of the mask patterns of Figures 1 2 and 1 3; and 
w Figure 15 is a schematic representation of an apparatus for microphotolithography. 

[0023] In the various Figures, like parts are identified by like references. 

[0024] The present invention involves first mathematically modeling the imaging of the pattern onto the substrate (e. 
g. from a mask), taking into account the illumination source and the pattern details. 

*5 [0025] There are two primary methods for calculating the aerial image for a finite illumination source. These methods 
are Abbe's formulation and Hopkins's formulation. In Abbe's formulation, each point source in the illumination config- 
uration produces a plane wave incident onto the pattern, and each of these point sources is imaged onto the wafer. 
Since the source is spatially incoherent, the total intensity at the wafer is the summation of the intensity produced by 
each of these point sources. Therefore, in Abbe's formulation, the integration over the illumination configuration is 

20 performed after the integration over the pattern. 

[0026] In Hopkins's formulation, the order of integration is switched, i.e., the integration over the source is performed 
first. In Hopkins's formulation, a four dimensional transmission cross coefficient (TCC) is defined, and the image in- 
tensity is the inverse Fourier transform of the TCC. A derivation of the TCC is described, for example, in Born and 
Wolf, Principles of Optics, 6 th Ed., pp. 528-532, herein incorporated by reference. 

25 [0027] The TCC is the autocorrelation of the projection pupil multiplied by the illumination pupil. The TCC is shown 
in Figure 1 as a set of three overlapping circles. From left to right, the first circle represents the illumination pupil J s (a, 
P) where a and p are coordinates of the illumination configuration. For the purposes of the following calculations, the 
radius of J s may be, for example, set to the maximum allowable outer o r for the lithography apparatus which will be 
used for imaging. It is also possible to set o> to 1 .0 or larger in order to perform feasibility studies and to determine the 

30 benefits of a larger o r . 

[0028] The central circle represents the projection pupil, K(a,p) that is centered at (-mA/P x NA, -nA/P y NA). The coor- 
dinate systems are normalized by a factor of X/NA so that the radius of K is 1 .0. The circle on the right likewise represents 
the projection pupil; however, it is centered at (pAyP x NA, qA/P y NA). In these last two expressions, m,n,p, and q corre- 
spond to discrete diffraction orders and it becomes clear that the TCC is a four dimensional (4-D) equation as described 
35 above. The diffraction orders in the x-direction are represented by m and p and the diffraction orders in the y-direction 
are represented by n and q. Though for purposes of this description, x and y coordinates are used, one skilled in the 
art understands that alternate coordinate systems could be used with appropriate changes of coordinate systems in 
the following equations. 

[0029] The TCC for a 4-D discrete point (m,n,p,q) is the integral of the shaded area where all three circles overlap. - 
40 Since the structure is assumed to be periodic, the Fourier transform of the pattern is discrete and the TCC is discrete. 
For a continuous pattern image, the pitch can be increased until an adjacent feature has no influence on the Fourier 
transform of the pattern of interest. The TCC in Figure 1 is described mathematically in Equation 1 . 



45 
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TCC(m r n,p,q) = 
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[0030] The TCC may be expanded to include the effects of the pattern by defining diffraction order cross coefficients 
(DOCC). The DOCC are defined in Equation 2 which is derived from the multiplication of the TCC by the Fourier 
55 transform coefficients of the pattern. 

DOCC(m,n,p,q) = T(m, n) 7* (- p,-q) TCC(m,n,p,q) Eqn. 2 
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[0031] Further, the radiation intensity at the wafer may be calculated by the inverse Fourier transform of the DOCC, 
as shown in Eqn. 3. 



'<*>)- EZEE^""*" , W g< "" , W C (..».,.,) Eqn. 3 



[0032] The projection optical system acts in part as a low pass filter, which reduces the diffraction orders so that only 
a few of the diffraction orders are important to the calculated image intensity. As a result, the TCC is a band limited 
function. The maximum necessary x and y orders can be calculated according to Equations 4 and 5 respectively. In 
each case, both the negative and positive orders are necessary, for example m extends from negative m max to positive 
m max (- m max ^ m < +m mmax ). Since both negative and positive orders are needed, the size of the TCC is 2m max +1 by 
2n max+ 1 b y 2 Pmax +1 b Y 2 q m ax +1 • Fortunately, however, because the TCC is band limited, it is not necessary to calculate 
all the pattern diffraction orders. Like in the TCC, only pattern diffraction orders -m max < m < +m max in the x direction 
and orders -n max < n < +n max in the y direction are needed. 



20 
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Eqn. 4 
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Eqn. 5 



[0033] Substituting Equations 1 and 2 into Equation 3 gives Equation 6 for the radiation intensity at the wafer. By 
switching the order of integration, as shown in Equation 7, that is, by using Abbe's formulation rather than Hopkins's, 
the portions of the illumination pupil which are most influential on imaging may be determined. Note that each of Equa- 
tions 6 and 7 extends across two lines. 
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Eqn. 7 
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[0034] Since a and p represent illumination pupil coordinates, a new function, J opt , may be defined. The new function 
J opt indicates which part of the illumination configuration (cc,p) is being used for a given diffraction order (m,n,p,q) and 
is expressed in Equation 8. From Equation 8, the image intensity can be calculated by multiplying it by the inverse 
Fourier coefficient (e**) and summing over all 6 variables (m,n,p,q,a,0) as shown in Equation 9. 
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J o P t («> A ™> n,p,q)=J s (a, fi)r(m, n)T* (- p-q)K 
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Eqn. 8 
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[0035] As will be appreciated, J opt is a six dimensional function and it is therefore difficult to apply it to the illumination 
configuration. In order to best determine which portions of the illumination configuration are significant to image for- 
mation, it is desirable to eliminate a few of the six variables. 

[0036] The aerial image intensity, l(x } y), is found by taking an inverse transform over m+p and n+q. When 
m+p=n+q=0, there is no modulation in the aerial image intensity. Since one of the goals of illumination optimization is 
to eliminate parts of the illumination configuration that have little or no influence on modulation, those portions of the 
illumination configuration for which m+p = n+q = 0 may be eliminated. In order to eliminate these parts and to better 
visualize the illumination configuration portions significant to image formation, a transformation of variables will elimi- 
nate two of the variables in the six dimensional J opt function (four diffraction orders) and convert it into a four dimensional 
function (two diffraction orders). The four dimensional function is called J opt -2D- Bv substituting Equations 10 and 11 
into Equation 9 for l(x,y), Equation 12 may be derived. 
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Eqn. 10 



£ =n + q=> q = c, -n 



Eqn. 11 
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Eqn. 12 



[0037] In Equation 12, J opt _ 2 D can be s © en to be the summation of J opl over rn and n after the transformation of 
45 variables according to Equations 10 and 11. By further substituting Equation 8 into Equation 12, J opt . 2D ma Y De ex- 
pressed as in Equation 13 and the intensity, l(x,y), can be written as a function of J opt . 2 D as jn Equation 14. 
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Eqn. 13 
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Eqn. 14 



5 



[0038] The function , J opt -2D' wn © n evaluated, shows those portions of the illumination configuration that are important 
for each diffraction order. Since J opt . 2 D is weighted by each diffraction order, T(m,n), large diffraction orders will have 

10 a greater influence on the aerial image. 

[0039] A starting point for the best illumination configuration for a particular pattern may be denoted J tot and is found 
by summing J opt . 2 D over t| and £ and subtracting J op t-2D( a 'M= 0 >£= 0 ) as shown in Equation 15. In Equation 15, when 
t|=0 and £=0 there is no modulation in the aerial image and the J O pt-2D( a >M= 0 £=°) component represents zero order 
or DC light. Points in the illumination that do not contribute to imaging increase the overall amount of DC light. Since 

15 the increased DC light causes no modulation it is not of great benefit, and moreover it can result in a reduction in depth 
of focus. 

[0040] Thus, an illumination configuration in accordance with J tot minimizes the amount of DC light and results in an 
improved process window. The equation J tot can be used to show which parts of the illuminator are more significant 
(or less significant) to image formation. 



[0041] Since the illumination configuration and pattern are coupled, optical proximity correction (OPC) changes in- 
fluence the diffraction orders, which therefore influences J tot . Consequently, modifications to the starting illumination 
configuration, J toti and the pattern should be performed a few times using iterations of processing with an OPC engine 
and an illumination engine, as will be understood by one skilled in the art. Furthermore, the pattern and illumination 
configuration also need to be adjusted to optimize a particular imaging criteria (depth of focus (DOF), end of line (EOL), 
aberration sensitivity, etc.) which may be performed with optimization software. However, since the pattern as a whole, 
rather than the OPC features, has the largest impact on the optimal illumination configuration, J tot is the best initial 
illumination configuration to lead to the fastest convergence for optimizing iterations over the illumination configuration 
and pattern. 

[0042] The starting illumination configuration, J tot , may be represented by a gray scale illumination configuration 
having continuous values of intensity over a 0 to 1 range. It is possible to create such a gray scale illumination config- 
uration with a diffractive optical element (DOE) or by using a quartz plate with dithered chromium plating. If a gray 
scale illumination configuration is not possible or preferred, the illuminator profile can be forced to be onlyO and 1 by 
applying a threshold to the gray scale, in which values above the threshold are rounded up to 1 and values below the 
threshold are rounded down to 0. An arbitrary threshold can be applied, or an optimal threshold may be found through 
simulating the process window, or by repeated test runs. 

[0043] Example 1 : The technique for calculating J tot outlined above was applied to a brick wall isolation pattern. A 
150nm pattern was shrunk to 130nm and 110nm design rules and imaged with a step and scan lithography system 
having a numerical aperture (NA) of 0.8. The isolation partem for the 130nm design rule is shown in Figure 2. 
[0044] The magnitudes of the diffraction orders of this mask feature are plotted in Figure 3. In Figure 3, the largest 
order is the (0,0) order or the DC background light. The orders that contribute the most to imaging are the (±2,0) orders 
and represent the vertical bricks in the brick wall partem. The other significant order is the (±1 ,±1) which represents 
the clear areas and defines the end of the isolation pattern. The higher orders also help to define two dimensional 
structures such as the end of each line. Since the diffraction orders are not constant, the orders change the weighting 
coefficients in the DOCC, which implies that the mask pattern influences the illumination strategy. 
[0045] The diffraction order coefficients T(m,n) in Figure 3 can be substituted into Equation 13 to calculate J^^d 
and are plotted in Figure 4. As can be seen from Figure 4, the largest contribution to J^t^D is tne Cn=0£=0) order. The 
(0,0) order does not contribute to imaging and decreases the DOF. As Equation 15 shows, this (0,0) order can be 
subtracted from the total illumination, J tot . Not considering the (0,0) order, the largest contribution is the (r|=±2,£=0) 
diffraction order, which represents the formation of the isolation lines along the x-direction. Another component that is 
large and defines the end of the isolation lines is the (t|=±1 £=±1 ) diffraction order. Although the (0,±2) diffraction order 
is rather small, higher orders combine in the T|=0 and ^=±2 region of the lens. These regions also help to define the 
end of line. The DOCC approach shows how the illumination pupil is sampled for improved image formation and is an 
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effective method for understanding imaging of the brick wall isolation pattern. 

[0046] Using Eq uation 1 5, the illumination pupil of the brick wall pattern for the 1 30nm design rule can be calculated 
and is shown in Figure 5. Figure 5 shows that the most significant areas for image formation are the outer portions of 
the illumination configuration along the x-axis. These outer portions form an elliptical dipole. In addition to these elliptical 
5 dipole elements, the center of the illumination pupil has a large contribution to image formulation. As noted above, the 
illumination pupil can be implemented in gray scale or binary illumination profiles. 

[0047] Depending on the apparatus being used, gray scale illumination may be possible. By gray scale illumination 
is meant controllable illumination intensity for which a normalized level ranging from 0 to 1 may be selected for at least 
given portions of the illumination configuration. For example, such control over illumination intensity may be produced 
10 by use of a diffractive optical element (DOE) in the illumination system. In this case, for example, the illumination 
configuration can be implemented as shown in Figure 5. However, some of the local spikes which are calculated in 
theory and seen in Figure 5 will be removed after low pass filtering of the illumination information as a result of the 
projection optics, as discussed above. Therefore, when designing an illumination configuration, spikes which will be 
filtered should be ignored. 

15 [0048] If a binary illumination configuration is used, i.e., only binary values for intensity of the illuminator are allowed 
(0 or 1 ), a threshold value should be chosen as a basis for assigning values of 0 or 1 to each point on the illumination 
configuration. For example, if a threshold of 0.8 is chosen, illuminator intensity values above 0.8 are rounded up to 1 
and values below 0.8 are rounded down to 0. Other threshold values may be applied as desired. 
[0049] Example 2: Using the gray scale to binary approach, a binary illumination configuration for the same brick 

20 wall isolation pattern was designed assuming a maximum outer a of 0.88 and is shown in Figure 6. 

[0050] The performance of the optimized illumination configuration in Figure 6 was then simulated for binary mask 
on a step and scan photolithography apparatus having NA=0.8 and X=248nm and compared to the simulated perform- 
ance of annular illumination. In the simulation, the vector (thin-film) imaging resist model was used since the numerical 
aperture was above 0.7. In this model, the resist is 400nm thickness of a type having a refractive index n=1 .76-J0.01 1 6), 

25 over 66nm of another type having n=1 .45-j0.3 on top of a polysilicon material having n=1 .577-j3.588. The results with 
the annular illumination (o jn =0.58 and of o out =0.88) and with the optimized illuminator (o out =0.88) are shown in Figures 
7 and 8, respectively. In both Figures 7 and 8, cross section results in the middle of the isolation region and top down 
simulation results are shown. In the Figures, the Bossung plot B from aerial image threshold is calculated by averaging 
the intensity through the resist, and the resultant linewidth, Iw, is plotted versus focus, f, for a threshold intensity. This 

30 technique tends to over predict the DOF as thickness loss and resist profile slope is not taken into consideration. A 
resist model that at least calculates thickness loss is probably necessary. In each of the figures, the top down results 
are plotted as solid, curved lines at the best threshold (best dose) as calculated by the Bossung plot. These simulated 
threshold images are compared to actual mask data shown in dashed, straight lines. 

[0051] Simulation results for the 130nm design rule brick wall isolation pattern are plotted in Figure 7 for a binary 
35 mask feature with NA of 0.8 using annular illumination (o in =0.58 and of a out =0.88). This annular setting has approxi- 
mately 0.4u.m of DOF from -0.4u.rn to O.Oujti focus. The contrast of the resist is low through focus, and can be imaged 
with a low contrast resist. However, at this low intensity contrast, the mask error enhancement factor (MEEF) is large 
and the exposure latitude (EL) is small. The top down images in Figure 7 also show that there is approximately 20nm 
of end of line (EOL) shortening, which can be fixed by extending the line slightly for the 130nm design.rule. However, 
as the design rule continues to shrink, extending the line is no longer feasible as the extended line may conflict with 
other features. Therefore, it is desirable to fix the EOL with the illumination. 

[0052] In Figure 8, simulation results for the 130nm design rule brick wall isolation pattern are plotted for a binary 
mask feature with NA of 0.8 and using the optimized binary illumination configuration in FIG. 6. The optimal illumination 
configuration has approximately 0.6u.m DOF from -0.45u.rn to +0.15u.m focus. In comparing the cross section images 

45 jn Figure 8 to those in Figure 7, the optimized illumination configuration has a larger contrast through focus as compared 
to annular illumination. This larger contrast implies that the MEEF for the optimized illumination configuration is lower 
compared to annular illumination and that the exposure latitude for the optimized illumination configuration is higher. 
Another benefit of the optimized illumination configuration is the improved line end performance as compared to annular 
illumination. The top down images in Figure 8 show that the optimized illumination configuration is capable of main- 

so taining the EOL without extending the line on the pattern, which is advantageous for more aggressive design rule 
shrinks. 

[0053] Example 3: The results in Figures 7 and 8 for binary mask (BIM) were compared to simulation results for 
chromeless mask (CLM). A chromeless brick wall isolation pattern was designed from experimental results of software 
simulation in a manner known to those skilled in the art. The chromeless technology requires (0,0) order light so as to 
55 fully benefit from the DOF improvement produced by off axis illumination. Experimental results from the simulation 
confirm the need for (0,0) order light for which purpose the isolation layer should be dithered or half toned. The half 
tone pitch may be chosen such that the first order in the dithered direction does not fall into the projection pupil. In the 
example, the lines were dithered in the vertical direction with pitch less than Ay[NA(1+a out )]. The dithering duty cycle 
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however should be tuned to optimize the amount of (0,0) order light for best DOF and pattern fidelity. In the simulation 
results for CLM the halftone pitch was 155nm with 50% duty cycle (77.5nm chrome islands). This pitch substantially 
prevented the (0,±1) orders from entering the projection pupil; however, this duty cycle should be tuned to maximum 
DOF with computer aided design tools. 

5 [0054] Example 4: Simulation results for the 130nm design rule layer were plotted for a CLM with 155nm halftone 
pitch and 50% duty cycle. The CLM was exposed on a X=248nm apparatus with NA of 0.8 and annular illumination 
(a ln =0.58 and a out =0.88). The CLM with this annular setting had 0.5u,m DOF (-0.4u.rn focus to +0.1 u,m focus). The CLM 
with annular illumination had greater DOF and better contrast through focus as compared to BIM with annular illumi- 
nation. This indicates that the CLM performed better than the BIM mask. The top down simulation results indicated 

10 that the EOL performance with CLM is theoretically better than the EOL performance with BIM and that the CLM was 
also able to better define the contact hole landing area as compared to BIM. 

[0055] Example 5: Simulation results for 1 30nm brick wall isolation pattern isolation layer were plotted for X^248nm 
apparatus with NA of 0.8 and the optimized elliptical dipole shown in Figure 6. These results were simulated with a 
reticle identical to the CLM reticle used in the preceding example, which has a 155nm halftone pitch and 50% duty 

15 cycle. The CLM exposed with this optimized illumination configuration had 0.7u,m DOF (-0.5u,m to +0.2ujti), an im- 
provement of 40%. The Bossung plots indicated that the isofocal intensity was around 0.21. A model based OPC 
approach could additionally be applied in order to tune the reticle to size at the correct linewidth, providing further 
improvements in performance. The linewidth may be corrected, for example, by biasing and by modifying the half tone 
duty cycle. The top down simulation results indicated that CLM was able to define the contact landing region and was 

20 able to maintain CD uniformity. Necking and other linewidth inconsistencies were reduced with this elliptical illumination 
configuration. Furthermore, the CLM reticle could be biased to improve the DOF, and consequently the EOL perform- 
ance should improve. Furthermore, model based OPC should be able to correct the EOL further. 
[0056] Example 6: Using the mask pattern of Figure 2 for a 1 1 0nm design rule isolation layer, an optimized illumination 
configuration was generated with Equations 13 and 15. In order to visualize sampling of the illumination pupil, Jop t . 2 D 

25 is plotted in Figure 9, showing x orders (T|=m+p) horizontally and y orders (£=n+q) vertically. As in Figure 4 for the 
130nm design rule, the largest contribution to 110nm design rule in Figure 11 is the (T|=0, £=0) order. This (0,0) order 
light is detrimental to the DOF and is eliminated in J tot as indicated in Equation 15. Figure 9 also shows that the (±1 ,±1) 
orders are the largest contributors to the illumination configuration optimization rather than the (±2,0) order. This is 
due to the fact that 11 Onm design rule is too aggressive for the 248nm apparatus with NA=0.8, as slightly higher NA 

30 js preferred to achieve this resolution. The orders that contribute most to defining the isolation linewidth are the (±2,0) 
orders. The (±2,0) orders, however, are at the far edge of the illumination configuration (0.8<a<1 .0), which indicates 
that a of 1 may provide improvement to implement the 11 Onm design rule at this wavelength. 

[0057] Using Equation 15 and the results in Figure 9, the optimized illumination configuration for the 110nm brick 
wall isolation layer is shown in Figure 10. Figure 1 0 shows that the illumination configuration areas that contribute most 
35 to image formation are a small portion in the center and far edges of the illumination configuration. One possible 
implementation of this illumination configuration is plotted in Figure 11a. In order to use 248nm apparatus to print more 
aggressive design rules and push the limit of the projection numerical aperture, an illumination configuration with o of 
1 .0, as shown in Figure 11b, and with small sectors (a ring width of 0.2) may be used. 

[0058] An implementation of the present invention includes selection of cells or particular gates that are critical. 

40 These critical features are then processed to determine J tot as described above. In Section 1, it was shown that the 
illumination configuration is pattern dependent. Therefore, if the pitch does not differ significantly for the critical features, 
it is possible to create a single illumination configuration which optimizes the process window for all the critical features. 
In Figure 12, an example of a circuit with critical gates g 1f g 2 , g 3 and a critical cell cc is shown. The diffraction orders 
of these tagged critical features can be calculated, and by using the theory already described, the optimized illumination 

45 configuration can be calculated. After calculating the optimized illumination configuration the process window can be 
calculated and compared to the process window with other illumination configurations. 

[0059] Another method of optimizing illumination/pattern interaction is to modify the pattern design with scattering 
bars. Scattering bars discretize the pitch from a semi-continuous function for an ASIC or logic design. After placing 
scattering bars, there are fewer pitches. This can be demonstrated in simulation software by placing the scattering 

so bars with an edge-to-edge separation of 0.61 X/NA. In Figure 13, the design in Figure 12 has been modified by adding 
a plurality of scattering bars. The illumination configuration can then be optimized for the modified design. The process 
window performance of an illumination configuration optimized for a design with scattering bars can then be compared 
to the process window of an illumination configuration optimized without scattering bars. Since a design with scattering 
bars discretizes the pitch, the combination of scattering bars with optimized off axis illumination (OAI) will have the 

55 largest possible DOF process window. 

[0060] Another concept for implementing illumination configuration optimization is through the placing of scattering 
bars based on space width (SW) considerations. A scattering bar is placed through rule based OPC which rules may 
be defined by the space width. With simulation software, it should be possible to calculate the probability density 



10 



BNSDOCID: <EP 1 239331 A2_l_> 



i 



EP 1 239 331 A2 



function (pdf) of the space width without scattering bars and with scattering bars. The illumination can than be optimized 
by considering the pdf by modifying J op t. 2 D as shown in Equation 16. Assuming that the vertical lines and horizontal 
lines are infinite, it is also possible to calculate the diffraction orders T(m,n). In Equation 17, the diffraction orders are 
calculated as a function of m and n where w is the line width, x is the intensity transmittance of the reticle, and P x =SW x +w 
and P y =SW y +w are the pitches in the x and y directions, respectively. 



■/.M) Z X \\dP x dP y pdf{Px)pdf(Py)T{m,n)r[-h-m),-k-n)] 
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P r NA 



P y NA 



Eqn. 16 



20 [0061] Equation 17 is a matrix of four equations for which, in order of presentation, m=n=0; m=0, n*0; m^O, n=0; 
and m*0, n*0. 



[0062] Calculating the optimal illumination configuration with the pdf can present some problems as it implies that 
some pitches are not as important as others. If all the gates in the pdf are considered to be critical, the pdf should be 
modified by a weighting factor. This weighting factor is a function of pitch called wf(Px). With this weighting factor, all 
the critical pitches should be treated the same such that wf(Px)»pdf(Px)=1 . This weighting factor should be added to 
Equation 1 6 by replacing pdf(Px) in Equation 1 6 with wf(Px)»pdf(Px). In the case that all of the pitches are critical, the 
weighting factors will not help to resolve the optimization and it is difficult to create an optimized illumination configu- 
ration without modifying the (pattern) design. 

[0063] One solution to this problem is to modify the design by adding scattering bars as discussed above. Scattering 
bars help to reduce pitches for isolated features. Once scattering bars are added to the design, the previously isolated 
features tend to act like dense features. Thus, scattering bars discretize the pitch from a continuous pdf to a more 
discrete pdf. Figure 14 is an example pdf for a logic pattern with features oriented in the y-direction (i.e. "vertical" 
direction) in which scattering bars have and have not been applied. Figure 14 shows vertical gate space widths (jam) 
on the x (horizontal) axis. For the unmodified design, D, without scattering bars, there are three discrete humps in the 
pdf at space widths of 0.2, 0.6 and 1 .5u.m. After the placement of scattering bars, D+SB, the number of pitches has 
been reduced such that most of the space widths are at dense pitches of 0.2u.m. With this change to the pdf, it is more 
likely that an illumination configuration can be optimized. 

[0064] The total illumination configuration for a design with both horizontal (x-direction) and vertical features is the 
sum of horizontal and vertical illumination configurations. If the illumination configuration is concentrated at o cx for the 
vertical features and is concentrated at for the horizontal features, the optimal illumination configuration will be a 
"conventional" quadrupole illumination configuration provided that j2o C) £ 1 and that j2o cy <1. Otherwise, this type 
of analysis will result in a four pole illumination configuration that has been rotated 45° 

[0065] The illumination technique presented herein may be extended to account for aberrations. Including aberrations 
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allows an operator to determine which part of the illumination configuration couples to the aberration. The amount of 
coupling is directly related to sensitivity of the image intensity to the aberration. By understanding this coupling, it may 
be possible to modify the illumination configuration to minimize the aberration sensitivity of a design. 
[0066] The projection pupil, K(a,p), for scalar imaging contains the obliquity factor, defocus, and the exponential of 
the wavefront represented by the Zernike polynomials. This scalar imaging pupil is shown in Equation 18. This pupil 
can further be divided into two parts, the unaberrated pupil K 0 (cc,P) and the aberrated pupil (the exponential of the 
wavefront); these two parts are multiplied together as shown in Equation 19. 
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Eqn. 18 
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Eqn. 19 
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Eqn. 20 



defocus 
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v=5 



Eqn. 21 



40 
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+ ... = 1 + X 



Eqn. 22 



[0067] From Equation 22, the wavefront can be written as a linear approximation, which is shown in Equation 23. 
By substituting Equation 23 into Equation 22, the linear approximation for the projection pupil, K(cc,p), can be calculated 
45 with Equation 24. 
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Eqn. 24 
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[0068] Since the TCC is a function of the projection pupil, K(a,p), the linear approximation to the pupil in Equation 
24 implies that the TCC can be represented by a linear approximation. This is accomplished by substituting Equation 
24 into Equation 1 , which results in Equation 25. Once again by neglecting the terms of power 2 or greater, the TCC 
in Equation 25 can be simplified as shown in Equation 26. 

[0069] The wavefront, W(a,p), is most often denoted by the summation of Zernike fringe polynomials as shown in 
Equation 21 . Using the linear theory of aberrations, the exponential, e x , can be represented by a Taylor series expan- 
sion. The Taylor series expansion is valid for small x, and previous work has shown good agreement for aerial images 
when Z v is less than 0.04X. The Taylor series expansion for e* is shown in Equation 22, In Equation 22, terms of power 
2 or greater have been dropped, which is valid provided that 2^ is less than 0.04 (0.04 2 =0.0016 and is negligible). 
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Eqn. 25 
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Eqn. 26 



[0070] By defining the unaberrated TCC, TCC 0 (m,n,p,q), and the aberrated TCC, TCC v (m,n,p,q), in Equations 27 
and 28, respectively, the TCC can be represented by a linear function of TCC 0 and TCC V as shown in Equation 29. 



TCC 0 (m,n >P9 q)= JJ J s (a,fi)K 0 

a 2 +0 2 «x 



a _ pX q x 

K P X NA P r NA } 



dad/3 
Eqn. 27 



TCC v (m^,p,q)^-i 2 ^ JJ ^(a.^Ko 



P X NA*' P y NA 



Eqn. 28 



rCC(w, n,p,q)= TCC 0 {» h n,p y q) + J^Z V \rCC v {m 9 n, p 9 q)+ TCC* V (- p-q -m -n)} 

v=5 

Eqn. 29 

[0071 ] Since the TCC can be constructed as a linear approximation as shown in Equation 29, J opt can also be written 
as a linear approximation. The linear approximation to J opt is derived in Equation 30 by using Equation 8 for J opt and 
by following the methodology for the linear approximation of the TCC as outlined in Equations 1 8 through 29. 
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Eqn. 30 
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[0072] Equation 30 for J opt can then be divided into a summation of the unaberrated J opt0 with the aberrated 
as shown in Equation 33. The definition of J opt0 and are shown in 
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Eqn. 32 



35 [0073] Equation 32 describes the portion of the illumination configuration which couples to a particular aberration. 
The amount of coupling affects the image intensity and helps to provide an understanding of aberration sensitivity to 
illumination. By combining Equations 31 and 32, J opt can be written as a linear approximation. 
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J o P t ( a > P> ™ m <i) = J op to(<x> Pi ™> n> p> v)+ Z z v P> m > n > Pi q)+ J l P tvi a i P-p >-q 

Eqn. 33 

[0074] In another aspect of the present invention, weighting factors can be introduced to maximize or to minimize a 
response to a particular metric including, for example, depth of focus (DOF), image log slope (ILS), image slope (IS), 
or aberration sensitivity. The optimal J tot of Equation 15 can be modified to include these weighting factors as shown 
in Equation 34. 

m n p q 

[0075] In general, photoresists react in proportion to the logarithm of the intensity of the light impinging thereon. As 
the intensity, and therefore the logarithm of the intensity, increases, the feature will be printed into the resist with better 
fidelity (i.e. improved resist profile and improved process window). Therefore, it is desirable to maximize the logarithmic 
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change in intensity (!LS). The ILS is defined in Equation 35. 



c din/ 1 d/ r- 



[0076] Since the derivative of the intensity changes faster than the inverse of intensity, Equation 35 will increase 
more by increasing the derivative of the intensity. The intensity can be calculated from Equation 3 and the derivative 
of the intensity with respect to x is defined in Equation 36. The derivative with respect to x results in the weighting 
function, w x , as shown in Equation 37. Likewise, a weighting function w y can be defined with respect to y as shown in 
10 Equation 38. 

& X m n p q * x 

= ZZ£5»."> U J * U ] DOCC{m,n.p, q ) 



20 m » P Q 



Eqn. 36 



25 w x = t=£(m + p) Eqn. 37 

' X 

iv y -/ — (n + q) Eqn. 38 

30 y 

[0077] Since pattern features and intensity features are two dimensional, the norm of the gradient may be used to 
indicate the change in intensity with respect to position. The norm of the intensity gradient is defined in Equation 39. 
This allows us to define a weighting function to calculate J tot in Equation 34. The weighting function to maximize the 
35 image log slope is defined by Equation 40. 



40 



w ILS (rr>,n,p,q) = 2* J(^ff + (^ff Eqn. 40 

[0078] Equation 40 shows that when m+p=0 and n+q=0, the weighting function becomes 0. When m+p=0 and n+q=0, 
45 these orders contribute nothing to image modulation and reflect DC contributions to the image. Furthermore, W, LS 
increases as m+p and n+q increase. This implies that higher order diffraction order terms are more highly weighted 
and contribute more to ILS. 

[0079] In addition to maximizing ILS, the depth of focus of the process will increase if the ILS is improved such that 
the intensity response to focus is minimized. The focus is accounted for by the pupil K(cc,p). The pupil, K(a,p), is shown 
50 in Equation 41 , where focus is denoted z. Equation 41 can be divided into two terms, terms dependant on z (the defocus 
term) and terms independent of z (the non-defocus term), as shown in Equation 42. 
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Eqn. 41 
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15 [0080] The variation in intensity due to focus, z, can be minimized by setting the derivative of intensity with respect 
to z to zero. By substituting Equation 42 into Equations 1 through 3, a cost function f(a,0,z) can be defined as shown 
in Equation 43 which is the cost function of the intensity imaging terms which depend on z. 
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Eqn. 43 
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[0081] The cost function, f(a,p,z), is in turn minimized when g(a,p,m,n,p,q) equals zero (see Equation 44, below). In 
Equation 44 the phase terms have been removed as the derivative with respect to z equals zero only when the mag- 
nitude terms equal zero. When g(a,p,m,n,p,q) is zero, the areas (ot,p) of the pupil for a given order (m,n,p,q) are 
minimally sensitive to focus. These are the most desired areas of the pupil for constructing the illumination configuration. 
A weighting function, W focus (a,0,m,n,p,q), is defined in Equation 45. This weighting function is equal to 1 for areas 
least sensitive to focus and is equal 0 for areas most sensitive to focus. A new weighting function which maximizes 
ILS through focus can then be defined by Equation 46 and used to modify the illumination configuration. 
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Eqn. 44 
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w focus(° L >$> m > n >P> c l) = 1 " IP(«,P,m,n f p,q)l 



Eqn. 45 
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w(a 5 p,m,r7,p,q) = w ILS (m,n,p,q)w focus (a,0,ro, n, p,q) 



Eqn. 46 



[0082] The above methodology allows the sensitivity of the intensity to be minimized to the influence of focus, an 
aberration. Since the impact of focus on intensity can be minimized, the impact of intensity can be minimized to a 
specific aberration. This is desirable for certain patterns which demonstrate a high sensitivity to a particular aberration. 
The projection pupil in Equation 19 can be written as an unaberrated term, ^(a,^), multiplied by an aberrated term, 
K a( a .P) as shown in Equation 47. 
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K(a.p)= K 0 (a,p)/< a (a,p) 



Eqn. 47 



[0083] The sensitivity of the intensity to a particular aberration Zj can be minimized by setting the derivative of the 
intensity with respect to Zj to zero. By substituting Equation 47 into Equations 1 through 3 and taking the derivative of 
the intensity, the aberration sensitivity is minimized when h(a,p,m,n,p,q) in Equation 48 is equal to zero. 
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Eqn. 48 



Eqn. 49 

[0084] Equation 48 may be simplified and written as Equation 49. A weighting function w ab (a,p,m,n,p J q) is defined 
in Equation 50, which equates to 1 for areas (a,P) of the pupil that are least sensitive to Zj and to 0 for areas most 
sensitive to Zj. 

w ab (a,$,m,n,p,q) = 1 - l\h{a,$,m,n,p,q)\ Eqn. 50 

[0085] The weighting function to minimize ILS sensitivity to a particular aberration, Z,, can then be defined in Equation 
51. Furthermore, a weighting function to minimize ILS sensitivity to a particular aberration, Zj, and to maximize ILS 
through focus can also be defined in Equation 52. Either of these equations can be substituted into Equation 34 to 
calculate the illuminator with the optimal response to a given metric. 
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w(a.,$,m,n t p,q) = iv w/LS (m ) n I p,qf)iv a6 (a,p f /n,n,p I Q) Eqn. 51 

45 

w(a,p,/7?,n,p,q) = w NILS (m i n,p,q)w focus (a,p,m,n,p,Q)iv flb (a,p,m,ap,Q) Eqn. 52 



[0086] Figure 15 is a schematic representation of an example of a lithography apparatus for use according to the 
present invention. The apparatus includes a radiation system. The radiation system is made up of a lamp LA (which 
may be an excimer laser, for example) and an illumination system which may comprise beam shaping optics EX, an 
integrator IN, and a condenser lens CO, for example. The radiation system supplies a projection beam PB of radiation. 
For example, the radiation system may provide ultraviolet, deep ultraviolet or extreme ultraviolet radiation. In general, 
the radiation system may also provide soft x-ray or other forms of radiation. 

[0087] A first object table, or mask table MT holds a mask MA. The mask MA includes a pattern area C which contains 
the mask pattern to be imaged. The mask table MT is movable relative to the projection beam PB so that different 
portions of the mask may be irradiated. Alignment marks M 1 and M 2 are used for determining whether the mask is 
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properly aligned with the substrate, or wafer, W. 

[0088] A projection system PL projects the projection beam PB onto the wafer W. The wafer W includes two alignment 
marks P 1 and P 2 which are aligned with the marks M 1 and M 2 prior to beginning imaging. The wafer W is supported 
by a substrate table WT which is moveable relative to the projection beam for exposing different parts of the wafer W; 
in this way, the mask pattern C may be imaged onto different target portions c of the wafer W. An interferometric position 
monitor I F is used to insure that the wafer table WT is in the correct position relative to the position of the mask table MT 
[0089] While the invention has been described in connection with particular embodiments, it is to be understood that 
the invention is not limited to the disclosed embodiments, but on the contrary it is intended to cover various modifications 
and equivalent arrangement included within the scope of the claims which follow. 

Claims 

1. A method of optimizing an illumination profile for a selected patterning means pattern, comprising the steps of: 



defining a transmission cross coefficient function for an optical system including an illuminator and the selected 
patterning means pattern; 

determining relative relevance to imaging of diffraction orders based on the selected pattern; and 
calculating an optimized illumination configuration from the transmission cross coefficient function, weighting 
20 regions of the illumination configuration based on the relative relevance to imaging of the diffraction orders. 

2. A method as in claim 1 , wherein the step of determining relative relevance to imaging of diffraction orders further 
comprises the step of: 

25 determining a characteristic pitch of the selected mask pattern. 

3. A method as in claim 2, further comprising the steps of: 

identifying a critical region of the selected pattern prior to determining the characteristic pitch, 

30 

wherein determining the characteristic pitch for the selected pattern is performed by determining the char- 
acteristic pitch for the critical region. 

4. A method as in claim 3, wherein the step of identifying a critical region further comprises identifying a plurality of 
35 critical regions and wherein determining the characteristic pitch for the critical region includes: 

comparing a pitch of each identified critical region; and 

if the pitch of each identified region is substantially equal, determining the characteristic pitch for the critical 
region to be equal to the characteristic pitch of one of the identified regions. 

40 

5. A method as in any one of the preceding claims, furthercomprising weighting regions of the illuminatorconfiguration 
based on a selected optimized metric selected from the group consisting of depth of focus, end-of-line, image log 
slope (ILS), image slope (IS), and aberration sensitivity. 

45 6. A method as in claim 1 or 2, further comprising the steps of: 

identifying a plurality of critical regions; 

determining a pitch of each of the identified critical regions; and 

calculating optimized illumination configurations from the transmission cross coefficient function, weighing 
so orders based on relevance to imaging of diffraction orders for each critical region, and 

wherein calculating an optimized illumination configuration further comprises calculating a composite opti- 
mized illumination configuration based on the calculated optimized illumination configuration for each critical re- 
gion. 



7. A method as in any one of the preceding claims, further comprising the steps of: 

modifying the selected pattern by optical proximity correction techniques to reduce a total number of different 
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pitches in the mask pattern. 

8. A method as in claim 7, wherein the step of modifying the selected pattern by optical proximity correction techniques 
further comprises addition of sub-resolution features to the selected mask pattern. 

5 

9. A method as in claim 7, wherein the steps of modifying the selected pattern and calculating an optimized illumination 
configuration steps are iterated. 

10. A computer program for optimizing an illumination profile comprising program code means that, when executed 
10 on a computer system, instruct the computer system to carry out the steps of the method of any one of claims 1 to 9. 

11. A device manufacturing method comprising the steps of: 

(a) providing a substrate that is at least partially covered by a layer of radiation-sensitive material; 
15 (b) providing a projection beam of radiation using an illumination system; 

(c) using patterning means to endow the projection beam with a pattern in its cross-section; 

(d) projecting the patterned beam of radiation onto a target portion of the layer of radiation-sensitive material, 
wherein, prior to step (d), the cross-sectional intensity distribution in the projection beam produced in step (b) 
is tailored to the pattern employed in step (c) using a method according to any one of claims 1 to 9. 

20 

12. A lithographic projection apparatus comprising: 

an illumination system for providing a projection beam of radiation; 

a support structure for supporting patterning means, the patterning means serving to pattern the projection 
25 beam according to a desired pattern; 

a substrate table for holding a substrate; 

a projection system for projecting the patterned beam onto a target portion of the substrate, 
wherein the apparatus additionally comprises: 

30 

calculating means, for defining a transmission cross coefficient function for the illuminator and the patterning 
means, determining relative relevance to imaging of diffraction orders based on the pattern produced by the 
patterning means, and calculating an optimized illumination configuration from the transmission cross coeffi- 
cient function, weighing regions of the illumination configuration based on the relative relevance to imaging of 
35 the diffraction orders; and 

selecting means, for selecting the cross-sectional intensity distribution in the projection beam exiting the illu- 
mination system in accordance with the illumination configuration calculated by the said calculating means. 

13. „A method of optimizing a selected mask design comprising: 

40 

identifying critical features of the selected mask design; 

determining an optimized illumination profile based on diffraction orders of the critical features; and 
modifying the selected mask design by use of optical proximity correction techniques which are selected to 
reduce a number of pitches present in the selected mask design. 

45 

14. A method as in claim 11 wherein the optical proximity correction further comprises addition of sub-resolution fea- 
tures selected to modify a continuous probability density function of the space width of the selected mask design 
such that the modified probability density function has an increased discretization. 

50 15. a method as in claim 13 or 14 wherein said step of determining an optimized illumination profile comprises the 
steps of the method of any one of claims 1 to 9. 

16. A computer program for optimizing a selected mask design comprising program code means that, when executed 
on a computer, instruct the computer to carry out the method of any one of claims 13 to 15. 

55 
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Fig.5. 




Fig.6. 
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